# Large Model Inference
Medgemma 27b Text It 4bit
Other
MedGemma-27B-Text-IT-4bit is an MLX-format model converted from Google's MedGemma-27B-Text-IT model, specifically optimized for medical and clinical reasoning tasks.
Large Language Model
M
mlx-community
193
3
Parakeet Tdt 0.6b V2 Onnx
NVIDIA Parakeet TDT 0.6B V2 is a model based on automatic speech recognition (ASR) tasks, suitable for English speech-to-text tasks.
Speech Recognition English
P
istupakov
129
3
Rank1 32b
MIT
rank1-32b is an information retrieval reranking model based on Qwen2.5-32B, which judges relevance by generating reasoning chains
Large Language Model
Transformers English

R
jhu-clsp
18
0
Meta Llama 3.3 70B Instruct AWQ INT4
Llama 3.3 70B Instruct AWQ INT4 is the 4-bit quantized version of the Meta Llama 3.3 70B Instruct model, optimized for multilingual dialogue use cases and text generation tasks.
Large Language Model
Transformers Supports Multiple Languages

M
ibnzterrell
6,410
22
Llama 3 8B Instruct QServe G128
Llama 3 is the next-generation open-source large language model introduced by Meta, featuring enhanced performance and broader application scenarios.
Large Language Model
Transformers

L
mit-han-lab
197
2
Featured Recommended AI Models